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DETAILED ACTION 



Claim Rejections - 35 USC §103 



1 . The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

2. Claims 1-27 are rejected under 35 U.S.C. 103(a) as being unpatentable over Ladd et al 
(6269336) in view of Maes (6801604) in further view of Hedin et al (6185535). 

As per claims 1,8, Ladd et al (6269336) teaches: 

"a method of operating a speech recognition system, comprising: 55 as the 
electronic network (fig. 3) containing voice/speech recognition capabilities (Fig. 3, 
subblocks 232,234); 

"augmenting the speech recognition system with an augmenting grammar set 
supplied by a first speech recognizer of portal 55 as the electronic network (synonymous 
with the term "portal 55 - portal is defined as a central starting point for users to access a 
wide variety of applications - see applicant's specification, under discussion of the 
related art) supplies the grammar as dictated by the user to upgrade the grammar set (col. 
4 lines 32-35; col. 4 line 62 - col. 5 line 19); 
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"and notifying the portal in response to an input which corresponds to the 
augmenting grammar set" as communication node (Fig. 3, subblock 212) notifying the 
electronic network (portal) yia the VRU server that recognition is performed (col. 8 lines 
55-65). 

As per claims 1,8, Ladd et al (6269336) does not explicitly teach the execution of 
the speech recognition outside of the portal, however, Maes (6801604) teaches the use of 
a second speech recognizer outside of the portal system (col. 4 lines 45-55), wherein the 
control is temporarily passed to the speech server to process the incoming data (Fig. 7, 
subblock 8 1 , control from subblock 80 and data from subblock 82), and then control 
passed back to the controller (fig. 7 between subblock 80 and 83, col. 15 lines 1-16; also 
note col. 1 1 lines 7-20 and col. 16 lines 45-60, detailing the temporary passage of control, 
and then back to the original program that had initial control) . Therefore, it would have 
been obvious to one of ordinary skill in the art of distributed speech applications to 
modify the teachings of Ladd et al (6269336) with remote speech processing along with 
control protocols because it would advantageously temporarily shift more intense speech 
processing to adequately prepared systems and then returning to the original program 
( Maes (6801604). col. 24 line 55 - col. 25 line 46). 

The combination of Ladd et al (6269336) in view of Maes (6801604) does not 
explicitly teach transferring control to the second recognizer independent of the portal, 
however, Hedin et al (6185535) teaches transference of the recognition between different 
recognition engines without going through the portal, as well as performing the 
recognition process in the remote location (abstract, fig. 5, col. 2 line 43 - col. 3 line 25; 
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col. 8 lines 35-64; Fig. 3, sub 205). Therefore, it would have been obvious to one of 
ordinary skill in the art of speech recognition systems to modify the combination of Ladd 
et al (6269336) in view of Maes (6801604) with localized control of the recognition 
process because it would make the system more robust to handle increased volume of 
recognition requests ( Hedin et al (6185535) , col. 2 lines 26-35). 

As per claim 2, Ladd et al (6269336) teaches: 

" the speech recognition system resides at an application server remote from the 
portal" as VRU server can be configured to be separated from the electronic network - as 
a stand alone system into a LAN - col. 9 lines 1-10). 

As per claim 3, Ladd et al (6269336) teaches: 

" transferring control of a call back to the portal after notifying the portal that the 
input corresponds to the augmenting grammar set" as transferring control back to the 
voice browser (col. 14 lines 29-35). 

As per claim 4, Ladd et al (6269336) teaches: 

"transferring a call to another application server which corresponds to the input" 
as transferring call control to a different (and appropriate server); (col. 8 lines 35-45 - 
Ladd's call control transfers the call's according to input (either pages or message) and 
routes items either thru a paging network or email network). 
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As per claim 5, Ladd et al (6269336) teaches: 

"directing the remote application server to perform one of a fixed set of pre- 
determined actions on behalf of the portal in response to a predetermined input" as 
application server (Fig. 3, subblock 242) instructing the VRU server to perform basic 
recognized functions such as automatic speech recognition (ASR's), text to speech 
(TTS), etc., (col. 9 lines 1-10). 

As per claim 6, Ladd et al (6269336) teaches: 

"directing the remote application server to perform an arbitrary routine on behalf 
of the portal in response to a predetermined input" as the application server is configured 
to allow the communication node to access information (col. 10 lines 61-66), and based 
on the information from the communication nodes, the application server redirects the 
information to a VRU server, and allows the VRU server decide what type of speech 
processing to perform (col. 1 1 lines 1-10). 

As per claim 7, Ladd et al (6269336) teaches: 

"directing the portal to perform an action in response to a predetermined input" as 
electronic network (portal) to access the VRU server when necessary (col. 10 line 67 - 
col. 1 1 line 1 1). 

As per claim 9, Ladd et al (6269336) teaches: 
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"a voice gateway to connect a call to the portal" as voice gateway (PSTN, carrier 
switch, Fig. 3, subblock 210). 

As per claim 10, Ladd et al (6269336) teaches: 

"that when a caller requests access to the application server, the voice gateway 
connects the call to the application server and breaks the connection between the call and 
the portal" as the electronic network (portal) contains a two choice path, a first path for 
recognized subscribers (col. 6 lines 37-50) and a second path for non-subscribers or non- 
recognized subscribers (col. 6 lines 50-65). When it is established that the user is 
recognized, the first path is chosen, the caller is in direct contact with the application 
server within the communication node (col. 6 lines 45-50, and the user is not connected 
directly (i.e., bypasses) to the part of the electronic network (portal) that perform 
personnel identification, speech command, or etc. the second path, for non-subscribers, 
maintains the connection between the portion of the portal that perform user 
identification, and if the identification is unsuccessful, the user is routed to a customer 
service representative (col. 6 lines 62-64). 

As per claim 1 1 , Ladd et al (6269336) teaches: 

"the portal includes a speech recognizer" as VRU server/client (Fig. 3, subblock 

234). 



As per claim 12, Ladd et al (6269336) teaches: 
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" the response to an input being recognized as corresponding to the augmenting 
grammar set, control of the call is transferred from the application server to the portal 55 as 
transferring control back to the voice browser (col. 14 lines 29-35). 

As per claim 13, Ladd et al (6269336) teaches: 

"the call being transferred to another application server in response to recognizing 
a predetermined input as corresponding to the augmenting grammar set 55 as transferring 
call control to a different (and appropriate server); (col. 8 lines 35-45 - Ladd's call 
control transfers the call's according to input (either pages or message) and routes items 
either thru a paging network or email network). 

As per claim 14, Ladd et al (6269336) teaches: 

"the application server performs one of a fixed set of pre-determined actions on 
behalf of the portal in response to a predetermined input which is recognized as 
corresponding to the augmenting grammar set 55 as application server (Fig. 3, subblock 
242) instructing the VRU server to perform basic recognized functions such as automatic 
speech recognition (ASR's), text to speech (TTS), etc., (col. 9 lines 1-10). 

As per claim 1 5, Ladd et al (6269336) teaches: 

"the application server performs an arbitrary routine on behalf of the portal in 
response to a predetermined input which is recognized as corresponding to the 
augmenting grammar set 5 ' as the application server is configured to allow the 
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communication node to access information (col. 10 lines 61-66), and based on the 
information from the communication nodes, the application server redirects the 
information to a VRU server, and allows the VRU server decide what type of speech 
processing to perform (col. 1 1 lines 1-10). 

As per claim 16, Ladd et al (6269336) teaches: 

"the portal performs a predetermined action corresponding to an input which is 
recognized as corresponding to the augmenting grammar set" as electronic network 
(portal) to access the VRU server when necessary (col. 10 line 67 - col. 1 1 line 11). 

As per claims 1 7,25-27, Ladd et al (6269336) teaches a method comprising: 

"connecting a call to a portal" as voice gateway (PSTN, carrier switch, Fig. 3, 
subblock210); 

"requesting services of a remote application server via the call" as after entering a 
dialogue with the user, the user can choose from a variety of information (col. 6 lines 44- 
50). Wherein the communication node (212) is remotely located (col. 7 lines 24-32); 

"transmitting an augmenting grammar set from the portal to the remote 
application server" as VRU client sends the user grammar information to the VRU server, 
which accesses the remote database 244 containing speech information (col. 8 lines 55- 
61); 
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"connecting the call to the remote application server" as LAN connects and routes 
the speech results to the call control unit, application server, and voice browser (col. 8 
lines 63-65); 

"breaking the connection between the call and the portal" as communication 
server 212 perform the functions of the output signal(col. 8 lines 63-67), wherein the 
electronic network (portal) contains a two choice path, a first path for recognized 
subscribers (col. 6 lines 37-50) and a second path for non-subscribers or non-recognized 
subscribers (col. 6 lines 50-65). When it is established that the user is recognized, the 
first path is chosen, the caller is in direct contact with the application server within the 
communication node (col. 6 lines 45-50), and the user is not connected directly (i.e., 
bypasses) to the part of the electronic network (portal) that performs personnel 
identification, speech command, or etc. The second path, for non-subscribers, maintains 
the connection between the portion of the portal that perform user identification, and if 
the identification is unsuccessful, the user is routed to a customer service representative 
(col. 6 lines 62-64). 

"notifying the portal when an input during the call corresponds to the augmenting 
grammar set" as notification to the portal that the user has been verified and that the 
caller is in direct contact with the application server within the communication node (col. 
6 lines 44-50). 

As per claims 1 7,25-27, Ladd et al (6269336) does not explicitly teach the 
execution of the speech recognition outside of the portal, however, Maes (6801604) 
teaches the use of a second speech recognizer outside of the portal system (col. 4 lines 
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45-55), wherein the control is temporarily passed to the speech server to process the 
incoming data (Fig. 7, subblock 81, control from subblock 80 and data from subblock 
82), and then control passed back to the controller (fig. 7 between subblock 80 and 83, 
col. 15 lines 1-16; also note col. 1 1 lines 7-20 and col. 16 lines 45-60, detailing the 
temporary passage of control, and then back to the original program that had initial 
control) . Therefore, it would have been obvious to one of ordinary skill in the art of 
distributed speech applications to modify the teachings of Ladd et al (6269336) with 
remote speech processing along with control protocols because it would advantageously 
temporarily shift more intense speech processing to adequately prepared systems and 
then returning to the original program ( Maes (6801604), col. 24 line 55 - col. 25 line 46). 

The combination of Ladd et al (6269336) in view of Maes (6801604) does not 
explicitly teach transferring control to the second recognizer independent of the portal, 
however, Hedin et al (6185535) teaches transference of the recognition between different 
recognition engines without going through the portal (abstract, fig. 5, col. 2 line 43 - col. 
3 line 25; col. 8 lines 35-64). Therefore, it would have been obvious to one of ordinary 
skill in the art of speech recognition systems to modify the combination of Ladd et al 
(6269336) in view of Maes (6801604) with localized control of the recognition process 
because it would make the system more robust to handle increased volume of recognition 
requests (Hedin, col. 2 lines 26-35). 



As per claim 18, Ladd et al (6269336) teaches: 
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"reconnecting the call to the portal in response to recognizing a predetermined 
input as corresponding to the augmenting grammar set" as allowing the caller to have 
access to the electronic network (portal) after the recognition has been performed (col. 6 
lines 55-60). Ladd's recognition routine access a grammar set in the voice recognition 
process (col. 8 lines 55-67). 

As per claim 19, Ladd et al (6269336) teaches: 

"performing a predetermined action in response to an input which is recognized as 
belonging to the augmenting grammar set" as performing a dialogue with a recognized 
user (col. 6 lines 25-49). 

As per claim 20, Ladd et al (6269336) teaches: 

"a system for operating a speech recognition system, comprising" as the 
electronic network (fig. 3) containing voice/speech recognition capabilities (Fig. 3, 
subblocks 232,234). 

"means for augmenting the speech recognition system with an augmenting 
grammar set supplied by a portal" as the electronic network (synonymous with the term 
"portal" - portal is defined as a central starting point for users to access a wide variety of 
applications - see applicant's specification, under discussion of the related art) supplies 
the grammar as dictated by the user to upgrade the grammar set (col. 4 lines 32-35; col. 4 
line 62 - col. 5 line 19). 

"means for notifying the portal in response to an input which corresponds to the 
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augmenting grammar set" as communication node (Fig. 3, subblock 212) notifying the 
electronic network (portal) via the VRU server that recognition is performed (col. 8 lines 
55-65). 

As per claim 20, Ladd et al (6269336) does not explicitly teach the execution of 
the speech recognition outside of the portal, however, Maes (6801604) teaches the use of 
a second speech recognizer outside of the portal system (col. 4 lines 45-55), wherein the 
control is temporarily passed to the speech server to process the incoming data (Fig. 7, 
subblock 81, control from subblock 80 and data from subblock 82), and then control 
passed back to the controller (fig. 7 between subblock 80 and 83, col. 15 lines 1-16; also 
note col. 1 1 lines 7-20 and col. 16 lines 45-60, detailing the temporary passage of control, 
and then back to the original program that had initial control) . Therefore, it would have 
been obvious to one of ordinary skill in the art of distributed speech applications to 
modify the teachings of Ladd et al (6269336) with remote speech processing along with 
control protocols because it would advantageously temporarily shift more intense speech 
processing to adequately prepared systems and then returning to the original program 
( Maes (6801604), col. 24 line 55 - col. 25 line 46). 

The combination of Ladd et al (6269336) in view of Maes (6801604) does not 
explicitly teach transferring control to the second recognizer independent of the portal, 
however, Hedin et al (61 85535) teaches trasnference of the recognition between different 
recognition engines without going through the portal (abstract, fig. 5, col. 2 line 43 - col. 
3 line 25; col. 8 lines 35-64). Therefore, it would have been obvious to one of ordinary 
skill in the art of speech recognition systems to modify the combination of Ladd et al 
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(6269336) in view of Maes (6801604) with localized control of the recognition process 
because it would make the system more robust to handle increased volume of recognition 
requests (Hedin, col. 2 lines 26-35). 

As per claims 2 1 -24, Ladd et al (6269336) teaches the input corresponding to at least one 
DTMF tone (col. 2 lines 56-61) as well as spoken utterances (col. 4 lines 33-38). 

Response to Arguments 

3. Applicant's arguments filed 1 1/21/2006 have been fully considered but are moot in view 
of the new grounds of rejection. The Hedin reference still applies to the claim limitations, 
including the new claim limitations pertaining to performing the speech recognition at the 
secondary location (in fact, Hedin specifically shows this, as pointed to in the above rejection). 
Again, examiner notes that focusing on the control/transferring protocols may overcome the 
Hedin reference. As a further note, Examiner notes the Zhou (699993 1) reference; and Brown 
(6604075) - Fig 3 reference, teaching separate connection for recognition processes. 

Conclusion 



4. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. Please see related art listed on the PTO-892 form. 
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5. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael Opsasnick, telephone number (571)272-7623, 
who is available Tuesday-Thursday, 9am-4pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examinees 
supervisor, Mr. Richemond Dorvil, can be reached at (571)272-7602. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 

MICHAEL OPSASNICK 
PRIMARY EXAMINER 
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primary examiner 
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